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In this paper we determine the percolation threshold for an ar- 
bitrary sequence of dense graphs (G„). Let X n be the largest eigen- 
value of the adjacency matrix of G„, and let G„(p„) be the random 
subgraph of G n obtained by keeping each edge independently with 
probability p n . We show that the appearance of a giant component in 
Gn(Pn) has a sharp threshold at p n — 1/A„. In fact, we prove much 
more: if (G„) converges to an irreducible limit, then the density of the 
largest component of G n (c/n) tends to the survival probability of a 
multi-type branching process defined in terms of this limit. Here the 
notions of convergence and limit are those of Borgs, Chayes, Lovasz, 
Sos and Vesztergombi. 

In addition to using basic properties of convergence, we make 
heavy use of the methods of Bollobas, Janson and Riordan, who used 
multi-type branching processes to study the emergence of a giant 
component in a very broad family of sparse inhomogeneous random 
graphs. 

1. Introduction. In this paper we study percolation on arbitrary se- 
quences of dense finite graphs, where the number of edges grows quadrati- 
cally with the number of vertices. The study of percolation on finite graphs 
is much more delicate than that of percolation on infinite graphs; indeed, 
percolation on finite graphs provides the finite-size scaling behavior of perco- 
lation on the corresponding infinite graphs (see, e.g., Borgs, Chayes, Kesten 
and Spencer [11] for the study of percolation on finite subcubes of 
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The first question one asks is whether there is a percolation phase tran- 
sition. In the case of a finite graph on n vertices, we say that a percolation 
phase transition occurs when the size of the largest component goes from 
being of order o(n) [typically in fact O(logn)] below a certain density to 
order n above that density. Of course, to make this precise, one must con- 
sider a sequence of graphs with n — > oo. The next question one typically 
asks is how the size of the second largest component behaves. In the few 
specific cases studied so far, the second largest component is of order logn 
both below and above the transition; the behavior above the transition is 
much more difficult to prove. Once the existence of the transition has been 
established, one then studies the finite-size scaling (i.e., behavior in n) of the 
width of the transition region, and the size of the largest component within 
that transition window. 

In this paper we establish the existence of a phase transition, including 
the behavior of the largest and second largest components, for a very large 
class of sequences of dense finite graphs. Moreover, we establish the location 
of the transition in terms of spectral properties of these graphs. 

Consider a sequence of dense graphs (G n ) and a sequence of random 
subgraphs G n {p n ) obtained from G n by deleting edges independently with 
probability 1 — p n . We say that the system percolates if G n (p n ) has a giant 
component, that is, a connected component of size 0(|G n |), where \G n \ de- 
notes the number of vertices in G n . As usual, we say that the appearance of 
a giant component has a sharp threshold if there exists a sequence (p n ) such 
that for all e > 0, the random subgraph G n (p n (l — e)) has no giant compo- 
nent with probability 1 — o(l) while G n (p n (l + e)) has a giant component 
with probability 1 — o(l). (Here, and throughout, all asymptotic notation 
refers to the limit as n — > oo.) 

The simplest sequence (G n ) for which this question has been analyzed 
is a sequence of complete graphs on n vertices. The corresponding random 
subgraph is the well-known random graph G n)Pn . Erdos and Renyi [16] were 
the first to show that with p n = c/n, the random graph G n C/ /„ undergoes a 
phase transition at c = 1: for c < 1, all components are of size O(logn) while 
for c > 1 a giant component of size O(n) emerges. Later, the precise window 
of this phase transition was determined by Bollobas [5] and Luczak [19]. 

Other specific sequences were considered in both the combinatorics and 
the probability communities. Ajtai, Komlos and Szemeredi [1] established a 
phase transition for percolation on the n-cube Q n = {0, l} n (see [7, 10] for 
much more detailed estimates on this transition). Borgs, Chayes, Kesten and 
Spencer [11] studied the case when the graphs G n are rectangular subsets 
of Z 2 , and determined both the width of the phase transition window and 
the size of the largest component within this window in terms of the critical 
exponents of the infinite graph 1? . 
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While the question of a phase transition for random subgraphs of gen- 
eral sequences (G n ) was already formulated by Bollobas, Kohayakawa and 
Luczak [7], progress on this question has been rather slow. The few papers 
which deal with more general classes of graph sequences are still restricted in 
scope. See, for example, Borgs, Chayes, van der Hofstad, Slade and Spencer 
[8, 9] where the window for transitive graphs obeying the so-called trian- 
gle condition was analyzed, Frieze, Krivelevich and Martin [17] where the 
threshold for random subgraphs of a sequence of quasi-random graphs was 
analyzed, and Alon, Benjamini and Stacey [2] for results about expander 
graphs with bounded degrees. 

Here we analyze the phase transition for random subgraphs of dense con- 
vergent graph sequences. The concept of convergent graph sequences was 
introduced for sparse graphs by Benjamini and Schramm [3] and for dense 
graphs by Borgs, Chayes, Lovasz, Sos and Vesztergombi in [12] (see also [13]). 
As shown in [14, 15], there are many natural, a priori distinct definitions of 
convergence which turn out to be equivalent. Here we use the following one: 
given two graphs F and G, define the homomorphism density, t(F,G), of F 
in G as the probability that a random map from the vertex set of F into 
the vertex set of G is a homomorphism; a sequence (G n ) of graphs is then 
said to be convergent if t(F, G n ) converges for all finite graphs F. Note that 
any sequence (G n ) has a convergent subsequence, so when studying general 
sequences of graphs, we may as well assume convergence. 

It was shown by Lovasz and Szegedy [18] that if a graph sequence con- 
verges, then the limiting homomorphism densities can be expressed in terms 
of a measurable function, W : [0, l] 2 — > [0, 1], which can therefore be thought 
of as the limit of the graph sequence. Following [14], we call such functions 
and their generalizations graphons. More precisely, a graphon is a bounded 
measurable function W : [0, l] 2 — > [0, oo) with W(x, y) = W(y, x) for all x, y. 
In [15] it is also shown that convergence implies convergence of the nor- 
malized spectra of the adjacency matrices to the spectrum of the limiting 
graphon considered as an operator on £ 2 ([0, 1]). 

Independently of the results above, Bollobas, Janson and Riordan [6] 
introduced a very general model of inhomogeneous random graphs with 
bounded average degree, defined in terms of so-called kernels. Although ker- 
nels are reminiscent of graphons, they are more general; in particular, they 
can be unbounded. One of the aims of [6] was to prove precise results about 
the emergence of the giant component in a general class of random graphs. 

Our main result in this paper says that a convergent graph sequence has 
a sharp percolation threshold, and, moreover, if the limiting graphon W 
is irreducible, then the density of the largest component is asymptotically 
equal to the survival probability of a certain multi-type branching process 
defined in terms of W (see Theorem 1 below). 
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As a corollary of this result, we obtain that the appearance of a giant 
component in an arbitrary sequence of dense graphs (G n ) (convergent or 
not) has a sharp threshold at p n = 1/A n , where A n is the largest eigenvalue 
of the adjacency matrix of G n (see Theorem 2 below). As usual, a sequence 
(G n ) with \G n \ — y co is called dense if the average degree in G n is of order 

e(|G n |). 

To state our results precisely, we need some notation. Given two graphs F 
and G, write hom(i ? , G) for the number of homomorphisms (edge-preserving 
maps) from F into G, and 

t(F, G) = |Gr |F| hom(F,G) 

for its normalized form, the homomorphism density. Following [13], we call 
a sequence (G n ) of graphs convergent if t(F,G n ) converges for every graph 
F. It was shown in [18] that a sequence (G n ) is convergent if and only if 
there exists a symmetric, Lebesgue-measurable function VF:[0,1] 2 — > [0,1] 
such that 

(1) t(F, G n ) -> t(F, W) for every graph F, 

where 



t(F,W) = 




is the homomorphism density of F in W . In this case the sequence is said 
to converge to W; in notation, G n — > W. 

We also need the notion of a weighted graph. For the purposes of this pa- 
per, a weighted graph G on a vertex set V is a symmetric function /3 : (v, w) i— > 
/3 VW from V x V to [0, oo) with j3 vv = for every v G V . (We thus do not 
allow vertex weights, and also restrict ourselves to nonnegative edge weights, 
instead of the more general case of real-valued edge weights considered in 
[14, 15].) Graphs correspond naturally to weighted graphs taking values in 
{0,1}. The definitions of t(F,G) and of convergence extend naturally to 
weighted graphs: if F is a graph on [k] and G is a weighted graph on V, 
then 

t(F,G) = \G\-\ F \ £ J] ? ViVj . 

vi,...,v k €V ijeE(F) 

Let (G n ) be a sequence of weighted graphs. We write f3ij(n) for the weight 
of the edge ij in G n , suppressing the dependence on n when this does 
not lead to confusion. We shall assume throughout that |G n | — > co, that 
/?max = su Pi,j, n Pij( n ) < 00 an d that (G n ) is convergent in the sense that 
t(F,G n ) converges for all unweighted graphs F, although we shall remind 
the reader of this assumption in key places. As shown in [14], the results of 
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[18] immediately generalize to such sequences, implying the existence of a 
graphon W such that G n —¥ W in the sense of (1). Let us note that in the 
context of unweighted graphs, graphons are usually defined to take values in 
[0, 1]. For weighted graphs with unbounded edge- weights, one could also con- 
sider unbounded limit functions W; we do not consider this extension here. 
One can also consider signed edge weights, in which case the appropriate 
limit objects are signed graphons, that is, (bounded) symmetric measurable 
functions from [0, l] 2 to R. 

Given a signed graphon W, the cut norm of W is 

||W||n = sup 

A,Bc[0,l] 

where the supremum is taken over all measurable sets. A rearrangement W 1 ^ 
of a graphon W is the graphon defined by W^(x,y) = W (<f>(x) , 4>{y)) where 
(f>: [0,1] — >• [0,1] is a measure-preserving bijection. Finally, the cut metric is 
the pseudometric on graphons defined by 

S n (Wi,W2) = M\\W 1 -Wt\\n. 

It is proved in [14] that G n — > W if and only if 
(2) 8 D (W Gn ,W)^0, 

where Wg„ denotes the piecewise-constant graphon naturally associated to 
the weighted graph G n . 

Given a weighted graph G, let G(p) be the random graph on V(G) in 
which edges are present independently, and the probability that ij is an 
edge is mm{p/3ij, 1}. We shall lose nothing by assuming that pfyj < 1, so 
we shall often write pfyj for min{p/3jj, 1}. Alternatively, as in [6], we could 
take pfyj to be a Poisson "edge intensity," so the probability of an edge 
is 1 — exp(— pfiij); this makes no difference to our results. Our aim is to 
study the giant component in the sequence G n (c/\G n \). To do this, we shall 
consider a certain branching process associated to the graphon cW. 

As in [6], there is a natural way to associate a multi-type branching process 
3Ly/ to a measurable PF:[0,1] 2 — >-R + : each generation consists of a finite 
set of particles with "types" in [0,1]. Given generation t, each particle in 
generation t has children in the next generation independently of the other 
particles and of the history. If a particle has type x, then the types of its 
children are distributed as a Poisson process on [0, 1] with intensity measure 
W(x, y) dy, where dy denotes the Lebesgue measure. In other words, the 
number of children with types in a measurable set A C [0, 1] is Poisson with 
mean ^ y&A W{x,y) dy, and these numbers are independent for disjoint sets 
A. The first generation of 3Lw consists of a single particle whose type x is 



AxB 



W(x, y) dx dy 



G 



BOLLOBAS, BORGS, CHAYES AND RIORDAN 



uniformly distributed on [0, 1]. Often we consider the same branching process 
but started with a particle of a fixed type x: we write Xw(x) for this process. 
If, as we shall always assume in this paper, W is bounded, then 



the expected number of children of a particle of type x, is bounded by ||W||oo; 
in particular, this expected number is finite. Thus every particle always has 
a finite number of children, and the total size of £yy is infinite if and only 
if the process Xw survives for ever. 

Writing \3Ly/\ for the total number of particles in all generations of the 
branching process 3Ly/, let p(W) = P(|5Ew| = oo) be the "survival probabil- 
ity" of %w, and let p(W; x) be the survival probability of Xw(x), the process 
started with a particle of type x. From basic properties of Poisson processes 
we have 



In general, the functional equation (3) may have several solutions. It is a 
standard result of the theory of branching processes (proved in the present 
context in [6], for example) that p(W;x) is given by the largest solution, 
that is, the pointwise supremum of all solutions. 
Let T\y be the integral operator defined by 



From Theorem 6.1 of [6], we have p(W) > if and only if ||TV|| > 1> where 
1 1 TV 1 1 is the L 2 -norm of T w . 

We shall show that the condition G n — > W is strong enough to ensure 
that the branching process captures enough information about the graph to 
determine the asymptotic size of the giant component in G n (c/\G n \). For 
this we need one more definition, corresponding roughly to connectedness. 

A graphon W is reducible if there is a measurable A C [0, 1] with < 
p(A) < 1 such that W(x, y) = for almost every (x,y) £ Ax A c . Otherwise, 
W is irreducible. Using the equivalent condition (2) for convergence, together 
with Szemeredi's lemma, it is easy to show that if G n —¥ W with W reducible, 
then the vertex set of each G n may be partitioned into two classes so that 
the induced graphs H n and K n converge to appropriate graphons W\ and 
W2, with o(|G n | 2 ) edges of G n joining H n to K n . In other words, the graphs 




(3) p(W; x) = l- exp ( - J W(x, y)p(W- y) dy 1 

for every x, and from the definitions of X\y(x) and 3Ly/ we have 
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G n may be written as "almost disjoint" unions of convergent sequences H n 
and K n . In the light of this observation, it will always suffice to consider the 
case where W is irreducible. 

Henceforth, for notational convenience, we shall assume that G n has n 
vertices; we shall often take the vertex set to be [n] = {l,2,...,n}. Note 
that we do not require G n to be defined for every n: all results will extend 
trivially to the general case \G n \ — > oo by considering subsequences. Let 
(X n ) be a sequence of nonnegative random variables and (a n ) a sequence 
of nonnegative reals. As usual, we say that a sequence of events S n holds 
with high probability, or whp, if P(£ n ) — > 1 as n — > oo. As in [6], we write 
X n = o p (a n ) if X n /a n converges to in probability, X n = 0(a n ) whp if 
there is a constant C such that X n <Ca n holds whp, and X n = Q(a n ) whp 
if there are constants < C\ < C2 such that C\a n < X n < Cia n holds whp. 
Note that the assertion X n = 0(a n ) whp is stronger than the assertion that 
X n /a n is bounded in probability [sometimes written X n = O p (a n )]. 

Theorem 1. Let (G n ) = {^ij{n))i^^[ n ] be a sequence of weighted graphs 
with \G n \ = n and /3 max = sn Pi,j,nPij{ n ) < 00 converging to a graphon W . 
Let c > be a constant, and let C\ = C\(n) denote the number of vertices 
in the largest component of the random graph G n {c/n), and C2 = £2(71) the 
number of vertices in the second largest component. 

(a) If c < \\T\vW~ 1 , then C\ = o p (n). 

(b) If c> \\TwW~ 1 , then C\ = Q(n) whp. More precisely, for any constant 
a < (c||2V|| — l)/(c/3 max ) we have C\ > an whp. 

(c) If W is irreducible, then C\/n A p(cW) and C2 = o p (n). 

In the result above, we may take c = 1 without loss of generality, rescaling 
the edge weights in G n . The heart of the theorem is part (c); as we shall see 
later [in the discussion around (14)], part (b) follows easily. 

The first two statements of Theorem 1 immediately imply that an arbi- 
trary sequence of dense graphs has a sharp percolation threshold. 

Theorem 2. Let (G n ) be a sequence of dense graphs with \G n \ = n, let 
\ n be the largest eigenvalue of the adjacency matrix of G n and let p n = 
min{c/A n , 1}. 

(a) If c<l, then the largest component of G n (p n ) is of size o p (n). 

(b) Ifol, then the largest component of G n (p n ) has size 0(n) whp. 

Proof. Theorem 2 follows immediately from Theorem 1 and the fact 
that any sequence of weighted graphs with uniformly bounded edge weights 
has a convergent subsequence. Indeed, to see (a), suppose for a contradiction 
that c < 1 , and for some e > there is a subsequence (which we again denote 
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by G n ) such that with probability at least e, the largest component of G n (p n ) 
has at least en vertices. Let G n be the sequence of weighted graphs obtained 
by weighting each edge in G n by n/\ n . Since (G n ) is dense, we have A n = 
0(n), so the edge weights in (G n ) are bounded above. By compactness, 
we may assume that the homomorphism densities t(F,G n ) converge, so by 
the results of [14], we may assume that G n converges [in the sense of (1)] 
to some graphon W. Since the largest eigenvalue of the adjacency matrix 
of G n is equal to n, we have that \\Tw\\ = 1, so by Theorem 1 the largest 
component of G n (p n ) = G n (c/n) has size o p (n), a contradiction. The proof 
of (b) proceeds along the same lines. □ 

For convergent sequences of weighted graphs converging to an irreducible 
graphon W, we shall prove stronger results about the sizes of the small 
components in the noncritical cases. This time we renormalize by scaling 
the edge weights, taking c = 1. 

Theorem 3. Let (G n ) be a sequence of edge-weighted graphs with \G n \ = 
n and /3 max = sup, j n fyj (n) < oo, and suppose that G n — > W. 

If ||7V|| < 1, then there is a constant A such that C\{G n {l/n)) < j41ogn 
holds whp. 

If \\Tw\\ > 1 and W is irreducible, then there is a constant A such that 
C2{G n (l/n)) < Alogn holds whp. 

It is easily seen that ||7V|| > 1 does not in itself imply that the second 
component has O(logn) vertices; indeed, for an appropriate sequence (G n ) 
in which each G n is the disjoint union of two graphs, H n and K n , with (H n ) 
and (K n ) convergent, H n (l/n) and K n (l/n) both have giant components 
whp, so the second component of G n (l/n) has 0(n) vertices whp. Note also 
that adding, say, 6(n 3 / 2 ) random edges to G n running from H n to K n will 
almost certainly join [in G n (l/n)] any giant components in H n (l/n) and 
K n (l/n) while preserving the condition G n — > W. Hence we cannot expect 
to find the asymptotic size of the giant component in the reducible case. 

Let iVfe(G) denote the number of vertices of a graph G that are in com- 
ponents of size (number of vertices) exactly k. The basic idea of the proof 
of Theorem 1 is to consider components of each fixed size; the key lemma 
we shall need is as follows. 

Lemma 4. Let W:[0, l] 2 — > [0, oo) be a graphon, and let (G n ) = 
(Pij{n))i j e \ n ] be a sequence of weighted graphs with G n — > W and /3 max = 
supj j n /3ij (n) finite. For each fixed k we have 

(4) -N k (G n (l/n))^n\X w \ = k) 

n 

as n — )• oo . 
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Lemma 4 tells us that we have the "right" number of vertices in small 
components; we shall then show that most of the remaining vertices are in 
a single large component. Of course, as with any such branching process 
lemma, the proof actually gives a little more: for any finite tree T, 1/n 
times the number of tree components of G n (l/n) isomorphic to T converges 
in probability to a quantity that may be calculated from the branching 
process. In particular, considering a rooted tree T* , the normalized number 
of vertices v in tree components that are isomorphic to T* with v as the root 
converges to the probability that the branching process %w is isomorphic 
to T* , when Xw is viewed as a rooted tree in the natural way. 

Lemma 4 will be proved in the next section; the proof of Theorem 1 is 
then given in Section 3. Theorem 3 is proved in Section 4 in two separate 
parts. 

Before turning to the proofs, let us remark on the relationship of the 
present results to those of [6]. It might at first sight appear that Theorem 1 
(and hence Theorem 2) would follow simply from Theorem 3.1 of [6]. Indeed, 
given a sequence (G n ), the sequence of random subgraphs G n (c/n) can be 
seen as an instance of the model of [6] with a sequence K n of kernels given 
by K n = cWa n • Passing to a subsequence, we can assume that these kernels 
converge in the cut metric. But the results of [6] require pointwise conver- 
gence almost everywhere, which is much stronger. Considering in particular 
the case where the G n are unweighted graphs, the kernels K n take only the 
values c and 0, so it is clear that in general they will have no subsequence 
that converges pointwise. 

One could of course ask whether Theorem 3.1 of [6] may be strengthened 
by relaxing the pointwise convergence condition to convergence in the cut 
metric. Ignoring complications due to the more general setting considered 
in [6], the answer for bounded kernels turns out to be yes, but that is es- 
sentially the content of the present paper. Indeed, as noted above, Theorem 
1 may be viewed as a very special case of such a strengthening, for kernels 
of a special form (piecewise constant on each 1/n by 1/n square) and the 
corresponding vertex space. It is easy to extend this to general sequences of 
uniformly bounded kernels as long as, in the terminology of [6], we restrict 
ourselves to the vertex space in which the types of the vertices are inde- 
pendent and uniformly distributed on [0, 1]. We can state such an extension 
without reference to the more complicated definitions in [6]. 

Given a graphon W, for n > supVF let G(n,W) be the random (sparse) 
graph on [n] obtained as follows: first choose X\, . . . ,X n independently and 
uniformly from [0,1]. Then, given X\, . . . ,X n , join each pair ij, i < j, with 
probability W(Xi,Xj)/n, independently of the other pairs. 

Theorem 5. Let W n be a sequence of uniformly bounded graphons and 
W a graphon, and suppose that d\j(W n ,W) — > 0. Then the conclusions of 
Theorem 1 hold with G n (c/n) replaced by G(n,cW n ). 
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Proof. This is essentially immediate from Theorem 4.7 of [14] and The- 
orem 1. Indeed, taking X\, . . . ,X n independent and uniform on [0,1], let 
H n = H(n,W n ) be the (dense) weighted graph on [n] with weights (3ij(n) = 
W n (Xi,Xj). Theorem 4.7 of [14] states that with probability at least 1 — 

e -n 2 /(21og 2 n) we have 

Sn(W n , W Hn ) < iL_ sup W n < 



\/log2 n ^og 2 n 

where M = sup n supW n < oo. It follows that Sa(W n ,WH n ) — > a.s. Since 
(5n(W n , W) — > 0, we thus have H n — > W a.s. To construct G(n, W n ), we con- 
dition on the sequence H n , and then realize G(n,W n ) as H n (c/n). In this 
conditioning we may assume that H n — > W, so the result follows by Theorem 
1. □ 



Note that while Theorem 5 greatly strengthens Theorem 3.1 of [6] by 
relaxing the convergence condition to one that can always be applied to a 
subsequence, it is much weaker in other ways: we must restrict to uniformly 
bounded graphons/kernels, and there is no obvious way to handle general 
vertex spaces. 

2. The number of vertices in small components. Let us first prove a 
slightly weaker form of the special case k = 2 of Lemma 4, calculating the 
expected number of isolated edges in G n (l/n). The only extra complications 
in the general case will be notational. Let 

dv — ^ ^ Pvw 

w 

denote the "weighted degree" of a vertex v in the weighted graph G n . Note 
that for v fixed, the quantity d v = d v {n) is determined by G n , so it is deter- 
ministic. 

Let v and w be two vertices of G n chosen independently and uniformly at 
random, independently of which edges are selected to form G n (l/n). Given 
a random variable X that depends on G n , v and w, but not on which edges 
are selected to form G n (l/n), we shall write E^^ for the expectation of X 
over the choice of v and w. We define E„ similarly. 

Let X be the vector-valued random variable X = (d v /n,d w /n, (3 VW ). Note 
that HXjloo < sup v w n j3 vw (n) < oo. For nonnegative integers t\ and let 

(5) X tut2 =E vw ((d v /n)^(d w /n) t2 /3 vw ) 

denote the (t\,t2, l)st joint moment of the triple X. Then 

V w 
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d u> ui,...,ttt! xi,...,xt 2 i i 

= n tl+t2+2 t(St 1) t 2 > G n ) , 

where is the "double star" consisting of an edge with t\ extra pen- 

dent edges added to one end and £2 to the other. Note that the summation 
variables are not required to be distinct, and that t(F,G n ) counts homo- 
morphisms, not injections. Since G n — > W, it follows that 

(6) X tljt2 ^t(S tl)t2 ,W) 
as n — >■ 00. 

For a given pair of vertices v, w, the probability [when we make the 
random choices determining G n (l/n)] that vw forms an isolated edge in 
G n (l/n) is exactly 

Pvw TT ( -1 Pvz \ 1 -. fiwz \ 

Pvw = 1 1 • 

n , \ n J \ n 

Note that the probability that v is one end of an isolated edge is 

^ ^ Pvw — ri^"wPvw ! 

so, with both v and w random, we have 

-E(jV 2 (G„(l/n))) = P(t; in isol. edge) = E vw {n Pvw ). 
n 

Since the /3's are bounded, we have 

/r7 \ Pvw ( d v d w 

(7) p vw ~ exp 

n \ n n 

Let D vw = d v /n + d w /n. Note that with v and w fixed, the quantity D vw = 
D vw (n) is determined by G n , so it is deterministic. For every v and w we 
have 



P vw exp(-D vw ) = y^(-l) 



t 

vw 



Taking v and w uniformly random, for each fixed n we have 

(8) E VW (P VW exp(-A, w )) = f2(-l) t KM ™ DL) . 

t=o *• 
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As n — > oo, from (5) and (6) we have 

E vw (/3 vw Dl w ) = E vw ((3 vw (d v /n + d w /nf) 



ti+t 2 =t v 7 t!+ta=* 

It is easy to see that the n — > oo limit may be taken inside the sum in (8); 
indeed, D vw is bounded by 2/3 max , so the tth summand in (8) is bounded by 
C t = /3 m ax(2/3 max )Vi!- Since J2t C t = Anax exp(2/3 max ) < oo, the sum in (8) 
is absolutely convergent, uniformly in n. Hence, 



vw ) 



t tp ( a ( n ^ \ 1 \t^ m n-^oo^ , vw(PvwD v 
hm E vw (/3 vw exp(-D vw )) = } (-1) 

t=0 

=E^ E 

Putting the pieces together, we have expressed the limiting expectation 
of N 2 (G n (l/n)) in terms of W: 

-E(N 2 (G n (l/n))) = E vw (np vw ) ~E vw (/3 vw exp(-D vw )) 
n 

-E^r 1 E (1)ks,„,„w) 

t=o ' h+t 2 =t v 7 

t 1= ot 2 =o 1 z 
Recalling that we write X(x) = J^W(x,y) dy, we have 

t(S tltta ,W)= [ [w(x,y)\(x) h \(y) t *dxdy, 

J x J y 

so the final quantity above is simply 
t 1= ot 2 =o x y 

As W is bounded, using dominated convergence we may take the sums inside 
the integral, obtaining 

I ! W(x,y)e- x{x) e~^ y) dxdy = F(\X w \ = 2). 

J x Jy 

We have thus proved a weak form of (4) for k = 2, namely convergence in 
expectation: ±E(iV 2 (G n (l/n))) ->■ F(\X W \ = 2). 
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Convergence in expectation for general k is essentially the same, although 
we must count trees with each possible structure separately. Note that we 
need only consider tree components: if N[(G) denotes the number of vertices 
of a graph G that are in components of size k that are not trees, then N' k (G) 
is certainly bounded by k (or in fact k/3) times the number of vertices of G 
in cycles of length at most k. In G n (l/n), this latter quantity has expectation 
at most 

k 

^X(Anax/™)^ < /cmax{l,/3^ ax }, 
e=3 

so we certainly have 

(9) E{N' k (G n (l/n))) < fc 2 max{l,^ ax } = o(n) 

as n —> oo with k fixed. After this preparation, let us now turn to the proof 
of Lemma 4. 



Proof of Lemma 4. Let T be a rooted tree on k vertices. Let aut(T) 
denote the number of automorphisms of T as a rooted tree. Thus, if Ti, . . . , T r 
are the "branches" of T, then aut(T) = /rjaut(Tj), where the factor / is 
the product of a factor j\ for each (maximal) set of j isomorphic branches 
Ti. 

The branching process 3Lw may be naturally viewed as a rooted tree, by 
joining each particle to its parent and taking the initial particle as the root. 
We write 3Cw — T if this tree is isomorphic to T as a rooted tree. Note that 

(10) F(\X w \=k) = ^2F(3L w ^T), 

T 

where the sum runs over all isomorphism classes of rooted trees on k vertices. 
Realizing T as a graph on {1,2,. . . ,k}, one can show that 



aut(T) J X1 
For example, the stronger statement 



x k j = i ijeE(T) 



\ )Jx 2 J Xhi=x ijeE(T) 

may be proved by induction on the size of T, noting that Xw — T holds if 
and only if, for each isomorphism class of branch Tj of T, we have exactly 
the right number of children of the initial particle whose descendants form 
a tree isomorphic to Tj. 
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Let T be a tree on {1, 2, . . . , k}, which we shall regard as a rooted tree with 
root 1, and let v = (v±, . . . , v^) be a fc-tuple of vertices of G n . If vi,...,Vf. 
are distinct, let p v ,T denote that probability that {vi, . . . ,Vk} is the vertex 
set of a component of G n (l/n) with v^Vj an edge of G n (l/n) if and only if 
ij is an edge of T. (Note that this condition is stronger than the component 
being isomorphic to T.) If two or more Vi are the same, set p v ,T = 0. As 
before, let d v = Y2weV(G n ) @ vw denote the "degree" of v in G n ; also, let 

Pv,T = PviVj ■ 

ij&E{T) 

Arguing as for (7), since (3 n 



Pv,T 

Furthermore, taking v±, . . . ,Vk uniformly random, and recalling from (9) that 
we may ignore nontree components, we have 

-E(JV fc (G n (l/n))) 
n 

= ^^P(random vertex v is root of cpt isom. to T) + o(l) 

T 

= l nT.^T) E Pv,T + o(l), 
T K ' vi,...,v k 

where the sum runs over all isomorphism classes of k-vertex rooted trees 
T, and the factor 1/ aut(T) accounts for the number of labelings of a tree 
component isomorphic (as a rooted tree) to T with a given vertex v\ as the 
root, which gives the number of distinct fc-tuples (v±, . . . ,Vk) corresponding 
to a certain rooted component. Putting the above together, we have 

±E(N k (G n (l/n))) = > ] > ] exp [ - > ] ^ ] + o(l) 




where the expectation is over the uniformly random choice of V\ , . . . , . 

The rest of the calculations are as before: it is easy to check that for 
nonnegative integers t\, . . . , we have 

n^> +fc E v ^ T H (^) = n^ +k t(T t ,G n ), 
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where Tt is the graph formed from T by adding t\ pendent edges to each 
vertex i, so 

k 

t(T t ,G n )^t{T t ,W) = f ■■■ ! J] W{x i ,x j )'\lX{x i ) ti . 

As all relevant sums are uniformly absolutely convergent, we can expand the 
term exp(— ^ d v Jn) as a sum of terms of the form Yli(d Vi / n)^ , sum and 
take limits as before, finally obtaining 




+ o(l). 



But, from (11), the final summand is just P(Xvf — T), so, from (10), 



-E(N k (G n (l/n))) -+ F(\X W \ = k). 
n 

To complete the proof of Lemma 4 it remains to give a suitable upper 
bound on the variance. Let N> k (G) denote the number of vertices of a 
graph G in components of size at least k, and set N> k = N> k (G n (l/n)). We 
shall show that 

(12) E((N> k /n) 2 ) < (E(N> k /n)f + o(l) 

as n — > oo. This will imply that N> k /n has variance o(l), and hence that 
N k (G n (l/n)) = N> k — N> k+ i is concentrated about its mean. 

Writing c(v) for the number of vertices in the component of G n (l/n) 
containing a given vertex v, and letting v and w be independent random 
vertices of G n (l/n), (12) is equivalent to 

(13) P(c(u) > k, c(w) >k)< F(c(v) > k)F(c(w) >k) + o(l). 

But this is more or less immediate from the fact that F(d(v, w) < 2k) = o(l), 
where d(v,w) denotes graph distance in G n (l/n). Indeed, let us first fix v 
and w. If c(v) > k, c(w) > k and d(v, w) > 2k, then we can find disjoint sets of 
edges E v , E w C G n (l/n) such that the presence of all edges of E v in G n {l/n) 
is sufficient to guarantee that c(v) > k, and similarly with v replaced by w. 
[In fact, if d(v,w) > 2k, then minimal witnesses for the events c(v) > k and 
c(w) > k must be disjoint.] In other words, writing, as usual, A □ B for 
the event that two (increasing) events A and B have disjoint witnesses, if 
d(v,w) > 2k, then whenever the events A = {c(v) > k} and B = {c(w) > k} 
hold, so does AID B. Hence 

F(A DB)< F(d(v, w) < 2k) + F(A □ B, d(v, w) > 2k) 

<F{d{v,w) <2k) +F(AU B). 
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By the van den Berg-Kesten inequality [4] (for a more general inequality, 
see Reimer [20]), F(ADB) is at most F(A)¥(B), so 

P(c(v) >k,c(w) >k)<W(c(v) >k)F(c(w) >k) + ¥(d(v,w) <2k). 

Choosing v,w uniformly at random and using the fact that K vw F(d(v,w) < 
2k) = o(l), we obtain the bound (13) and hence (12), completing the proof 
of Lemma 4. □ 

Lemma 4 has an immediate corollary showing that the "right" number of 
vertices are in "small" components, as long as "small" is defined suitably. 

COROLLARY 6. Let W:[0, l] 2 — > [0, oo) be a graphon, and let (G n ) = 
(/3ij{n))i j e \ n i be a sequence of weighted graphs with G n — > W and /3 max = 
supi j n Pi j(n) finite. Then 

-N >0J (G n (l/n))^p(W) 
n 

as n — ^ oo whenever uj = u)(n) tends to infinity sufficiently slowly. 

Proof. By Lemma 4 we have \N k (G n (l/n))/n - ¥{\X W \ = k)\ 4 for 
every fixed k. It follows that there is some function uj = uj(n) with uj(n) — » oo 
such that Ylk=i \Nk(G n (l/n))/n — ¥(\Xw\ = k)\ — > 0. Reducing uj if neces- 
sary, we may and shall assume that uj(n) = o(n). Let us say that a component 
is small if it has size at most u[n) and large otherwise. Note that the number 
N<u(G n (l/n)) of vertices in small components satisfies 

1 

-N<u{G n (l/n)) =5>(|£ W | =k) + op(l) = F(\Xw\ < w(n)) + o p (l) 
fc=i 

= F(\X W \ <oo) + Op(l). 

(For the last step, recall that X\y is defined without reference to n.) Hence, 
the number of vertices in large components is asymptotically np(W) = 
nF(\Xw\ = oo), as claimed. □ 

In the case where ||?V|| < 1, Theorem 1 follows from Corollary 6: we have 
piW) = 0, so there are o p (n) vertices in large components, and the largest 
component has size o p (n). When ||TV|| > 1, it remains to show that almost 
all vertices in large components are in a single giant component. For this we 
shall use a sprinkling argument. 
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3. Joining up the large components. In the light of Corollary 6, to prove 
the heart of Theorem 1, namely part (c), it remains to show that when W is 
irreducible, almost all vertices in "large" components are in fact in a single 
giant component. Before doing this, we shall show that part (b) of Theorem 
1, concerning the reducible case, follows from part (c). As the reducible case 
is rather uninteresting, we shall only outline the argument, omitting the 
straightforward details. 

If W is reducible, it is easy to check that W may be decomposed into a 
finite or countably infinite sequence Wi, Wjj, ... of irreducible graphons in a 
natural sense. (For a formal statement and the simple proof, see Lemma 5.17 
of [6].) Here each graphon Wi is defined on A* x Aj, rather than on [0, l] 2 , 
for some disjoint sets A C [0,1]. If ||?V|| > 1, then there is an i for which 
\\Twt II > 1- in f ac t> since W is bounded (by /3 max < oo), only finitely many of 
the Wj may have \\Tw- \\ > 1, so there is an i with ||TVJ| = ||TV||. As noted 
in the Introduction, if G n — > W it is easy to check that we may find induced 
subgraphs H n of G n with \H n \ = (m(Aj) + o(l))n and H n — > W{. Since Wi 
is irreducible, assuming the irreducible case of Theorem 1, whp the graph 
H n (c/n), and hence G n (c/n), will contain a component with @(n) vertices. 

To obtain the explicit constant in Theorem 1(b), it remains only to show 
that if W (here Wi, rescaled to [0, l] 2 ) is an irreducible graphon with \\Tw\\ > 
1, then 

(14) p(W) > 11 



This crude bound is implicit in the results in [6]: indeed, Tyy has a positive 
eigenfunction ip (see [6], Lemma 5.15) with eigenvalue A = ||?V||. Since 



\\T w \\1>(x) = \il>(x) = (T w 1>)(x)= / W(x,y)iP(y)dy<\\W 



U 



we have ||Tvj/||||V'||oo ^ ll^'lloollV'lli- From [6], Remark 5.14, we have 

o(w) > 

P{ \\Tw\\ Woo' 

which then implies (14). 

In the light of the comments above, from now on we assume that W 
is irreducible. In joining up the large components to form a single giant 
component, we must somehow make use of this irreducibility. By an (a, b)-cut 
in W, we shall mean a partition (A, A c ) of [0, 1] with a < p{A) < 1 — a such 
that f AxA c W < b. We start with a simple lemma showing that irreducibility 
[no (a,0)-cut for any < a < |] implies an apparently stronger statement. 
Recall that our graphons are bounded by definition. 
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Lemma 7. Let W be an irreducible graphon, and let < a < % be given. 
There is some b = b(W,a) > such that W has no (a,b)-cut. 

PROOF. Define a measure v on X 2 = [0, l] 2 by setting v(U) = J u W(x, 
y) dxdy for each (Lebesgue-)measurable set U C X 2 . Renormalizing, we may 
and shall assume that W(x,y) < 1 for every (x,y) € X 2 , so that v{B xC)< 
fi(B)/j>(C). As W is irreducible, we also have w = v{X 2 ) > 0. 

Suppose that the assertion of the lemma is false. Then there is a sequence 
(Ai, A1) of pairs of complementary subsets of X such that a < n(Ai) < 1 — a 
and the ^-measure of the cuts Cj = (Ai x AI) U (Af x Ai) tends to 0. By 
selecting a subsequence, we may assume that v(Ci) < 2~ l ~ 1 w for every i > 1. 

For m > 1, let V m be the set of atoms of the partitions V% = (Ai,A^), 
i = l,...,m. Thus D 6 V m if and only if D = f| ™ =l B h with i% = A { or A? 
for each i. Similarly, write £ n for the collection of atoms of the partitions 
V n ,V n+ i,.... Since 

m 

x 2 = (Jau U pxB) 

i=l DeV m 

we have 

m 

i=i DeVm Dev m 

<w/2 + max fi(D) ^ /j,(D) = w/2 + max /i-(-D). 
Dev m 

This shows that for each m > 1, we can find a D m G D m with n(D m ) > u;/2. 
Clearly, if m < n and G P n , then there is a (unique) € P m , with D 
D' n . Since each T> m is finite, by a standard compactness argument (repeated 
use of the pigeonhole principle) we may assume that D\ D -D2 ^ D% D Let 

= f|~ =1 D m . Then ^ G £i and w/2 < /i^i) < 1 - a. For n > 1, let £ n be 
the atom in f n containing then E± C E2 <Z ■ ■ ■ and w/2 < fi(E n ) <1 — a 
for every n. Finally, set E 1 = U^Li so that w/2 < n(E) < 1 — a. 

We claim that this set E shows that W is reducible. Indeed, for any n, 
x £ E implies there is an m > n with x £ E m . Thus 



E x £ c c |J (E m x 

Since E m x £^ C U£m we nave 

00 00 
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so 



1/ 



(E x E c ) < inf > v(E m x E c m ) < inf 2 



-n+l 



W = 



n z — * n 



contradicting our assumption that W is irreducible. □ 
We are now ready to complete the proof of Theorem 1. 

Proof of Theorem 1. As before, we normalize so that c = 1. As noted 
at the start of the section, we may assume that W is irreducible. 

By Corollary 6, there is some to = to(n) with to(n) — s> oo and u{n) = o{n) 
such that 



In particular, since the size C\ of the largest component of G n (l/n) is at most 
the maximum of to and N >ul (G n (l/n)), for any e > we have C\ < (p(W) + 
e)n whp. Similarly, the sum of the sizes of the two largest components is 
at most (p(W) + 2e)n whp. Since p(W) = if ||7V|| < 1, it remains only to 
show that, if ||7V|| > 1 and W is irreducible, then C\ > (p(W) — e)n holds 
whp. 

Theorem 6.4 of [6] which, like all results in Sections 5 and 6 of that paper, 
applies to all graphons (rather than the more restrictive kernels considered 
elsewhere in [6]), tells us that if (Wk) is a sequence of graphons with Wk 
tending up to W pointwise, then piWk) — > p(W). In particular, we have 
p((l — 5)W) — > piW) as 8 — > 0. (If we are only interested in the existence of 
a giant component, rather than its size, then we may use instead Theorem 
6.7 in [15].) It thus suffices to show that for any 5 > 0, whp we have 



say. In doing so we may of course assume that 5 is small enough that 



say. We shall also assume that 5 < 1/100. 

Let G' = G n ((l — S)/n) be the (unweighted) graph on [n] in which the 
edges are present independently, and the edge ij is present with probability 
(1 — 5)f3ij/n, where j3y = j3ij(n) is the weight of ij in G n . We may regard G' 
as G' n (l/n), where G' n is obtained from G n by multiplying all edge weights 
by 1 — 5. Since G' n — > (1 — 8)W , by Corollary 6 there is some oj(n) — > oo with 
to(n) = o{n) such that 



-N >u (G n (l/n)) A p(W) = F(\X W \ = oo). 



C 1 /n>p((l-5)W) -35, 



(15) 



p({l-5)W) > 10(5, 



(16) 



N > UG')/n>p((l-5)W)-5 



holds whp. 
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By an (a, b)-cut in an n-vertex weighted graph G we shall mean a partition 
of the vertex set of G into two sets X , X c of at least an vertices such that 
the total weight of edges from X to X c is at most bn 2 . By Lemma 7, there 
is some b > such that W has no (<5, 26)-cut. We may and shall assume that 
b < 1/10, say. Since G n — > W in the cut metric [see (2)], it follows that if n is 
large enough, which we shall assume from now on, then G n has no (5, 6)-cut. 

The graph G n (l/n) may be obtained from G' by adding each nonedge ij 
with a certain probability s^, independently of the other nonedges, where 
Sij > SPij/n, and the inequality is strict unless /3y = 0. In fact, we shall add 
each nonedge ij with probability exactly Sflij/n; the correction from this 
value to the true value of only works in our favor. 

Let us condition on G' , assuming as we may that (16) holds. Let C\,. . . ,C r 
list the "large" components of G' , that is, the components with more than oj 
vertices. To complete the proof of Theorem 1, it suffices to show that, whp, 
all but at most 2Sn vertices of IJCj lie in a single component of G n (l/n). 
Let B be the "bad" event that this does not happen, so we must show that 
¥(B) = o(l). Since G' is a subgraph of G n (l/n), whenever B holds there is 
a partition X UY of {1,2, ... ,r} such that G n (l/n) contains no path from 

C x = Uiex d to C Y = U e y with \ c x\, \Oy\ > 25n. 

Given a weighted graph G, with edge weights f3 vw , for W C V(G) and 
v £ V(G), we write 

e(v,W) = e G (v,W)= ^ f3 vw . 

Similarly, for V, W CV{G), 

e(V,W) = e G (V,W) = Y,Yl P™' 

v&VwGW 

Unless stated otherwise, the quantities e(v, W) and e(V, W) will refer to the 
weighted graph G = G n . 

Fix G' (and hence C\,...,C r ) and a partition X, Y of {1,2, ...,r} for 
which \Cx\i \Cy\ > 25n. We shall inductively define an increasing sequence 
So, S±, . . . , Sz of sets of vertices of G n , in a way that depends on Cx and 
on G n , but not on the "sprinkled" edges of G n (l/n) \ G' . We start with 
<5o = Cx, noting that |<So| ^ 28n. We shall stop the sequence when \St\ first 
exceeds (1 — 5)n. Thus, in defining St+i from St, we may assume that bn < 
\St\ < (1 — 5)n. Since G n has no (5, 6)-cut, we have 

Y J <v,S t ) = e{S c t ,St)>bn 2 . 

v£S t 

Let 



T t+1 = {v<£S t :e(v,S t )>bn/2}. 
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As all edge weights in G n are bounded by /3 max , we have e(v, St) < /3 m ax|<St| < 
Pmaxn for any v, so 

bn 2 < e(S^S t ) < —\V(G n )\(S t UT t+1 )\ + p max n\T t+1 \ < — + /3 max n|T m |. 
Hence, |Ti + i| > ^ n . Set S^+i = St UT^+i, and continue the construction 

1 1 -^Pmax 1 

until we reach an S £ with \S e \ > (1 - <5)ra. Note that £ < ^f^- = 0(1). 

We shall now uncover the "sprinkled" edges between T% and 5t-i, working 
backwards from Tg. It will be convenient to set To = So, so St = Ut'=o^*'- 
Since \Sg \ > (1 — <5)n, while |Cy| > 25n, the set S^ contains at least Sn vertices 
from Cy. Since So = T> = Cx is disjoint from Cy, it follows that there is 
some toi 1 < *o 5; ^> f° r which T contains a set Yo of at least 5n/£ vertices 
of Cy. Passing to a subset, we may assume that 

|r » l=mm {-'i(vu:} +0(1) - 

so \Y Q \ = 0(n) but |y | < 6n/( 10 /3maic) < |3*|/5 for 1 < t < £. 

Next, we construct a set Xq C St _i with |Xo| > 56|lo|/5 such that every 
x G Xo is joined to some y 6 Yo by an edge of G n (l/n) \ G' . We start with 
the observation that for every vertex y £Yq we have e(y, St -i) > bn/2, 
since y € T t() . Hence the expected number of edges of G n (l/n) \ G' from ?/ 
to 5t _i is at least 5b/2, and the probability that at least one such edge 
is present is at least 1 — exp(— 5b/2) > 56/4. Furthermore, this conclusion 
remains true (with 5b/2 replaced by (56/3) even if we exclude a subset of 
St -i of size at most |Yo|, corresponding to at most one neighbor x' G <St -i of 
each vertex y' G Yo previously considered. [To see this, we note that for every 
X C S to -i with |X | < |Yb|, we have e(y,S to ~i \ X ) > bn/2 - /3 max |Y" | > 
bn/2 — 6n/10 > bn/3.] Using independence of edges from different vertices 
y, and the concentration of the binomial distribution, it follows that with 
probability at least 1 — exp(— 0(n)), we find a set Xq of at least <56|Yo|/5 
vertices of St -i such that every x G Xq is joined to some y G Yo by an edge 
of G n (l/n)\G'. 

Since \Xq\ > 5b\Yo\/5, there is some t± < tQ such that Y\ = Xq n T tl con- 
tains at least Sb\Yo\/(5£) vertices. If t\ > 1 then, arguing as above, with prob- 
ability 1 — exp(— @(n)) we find a £2 and a set Y2 of at least <5 2 6 2 |Yo|/(5^) 2 
vertices of T 2 joined in G n (l/n) to Y\, and so on. As the sequence to, t\, . . . 
is decreasing, this process terminates after s < I steps with t s = 0. Hence, 
with probability 1 — exp(— 0(n)) we find a set Y s of at least (5b / (5£)Y \Yq\ = 
O(n) > 1 vertices of Tq = Sq = Cx joined in G n (l/n) by paths to vertices 
in Cy. In particular, the probability that there is no path in C n (l/n) from 
Cx to Cy is exponentially small. 

Recalling that r <n/uj = o(ri), the number of possible partitions X , Y of 
the components C\,...,C r is at most 2 r = exp(o(n)), so the probability of 
the bad event B is o(l), as required. □ 
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4. Stronger bounds on the small components. In this section we prove 
Theorem 3, considering the subcritical and supercritical cases separately. 

4.1. The subcritical case. We start by proving the first statement of The- 
orem 3, restated below for ease of reference. 

Theorem 8. Let (G n ) = (Pij(n))ij£[ n ] be a sequence of weighted graphs 
with \G n \ = n and /3 max = sup 4 j n (n) < co converging to a graphon W. If 
\\Tw\\ < 1, then there is a constant A such that Ci{G n {\/n)) < ^logn holds 
whp. 

Proof. Let 6 be a positive constant chosen so that (1 + £)||2V|| < 1. 
Let W n = Wc n be the piecewise constant graphon naturally associated to 
G n , and let W' n = (1 + S)G n . We claim that if n is large enough, then the 
neighborhood exploration processes associated to a random vertex of G n 
may be coupled with £yy> , viewed as an n-type branching process, so that 
the latter dominates. 

In the exploration process, we start with a random vertex of G n . Having 
reached a vertex i, we check for possible "new" neighbors of i not yet reached 
from other vertices. The chance that j is such a new neighbor is either 
j3ij{n)/n or 0, depending on whether j has been previously reached or not. 
In particular, this process is dominated by (may be regarded as a subset of) 
a binomial n-type process in which we start with a particle of a random type, 
and each particle of type i has a Bernoulli B(/3jj(ra)/n) number of children of 
type j, independently of everything else. The process ma Y De described 
in exactly the same terms except that the number of children of type j has 
a Po((l + 5)/3ij(n)/n) distribution. As f3ij(n) is uniformly bounded and 5 
is fixed, this Poisson distribution dominates the corresponding Bernoulli 
distribution for all large enough n. 

Although the branching processes £\yr have different kernels, these ker- 
nels are uniformly bounded. Furthermore, since W and the W n are uniformly 
bounded and W n — > W in the cut norm, it is easy to check (for example, by 
considering step function approximations to eigenfunctions of the compact 
operators T\y n and T\y) that ||^V n || — >• ||1V||. (In fact, as mentioned in the 
Introduction, much more is true — in [15] it is shown that the normalized 
spectra converge.) Hence, for all sufficiently large n, the norms \\Tw^ || are 
bounded by some constant strictly less than 1. It is a standard result that 
the branching processes 3lw^, associated to uniformly bounded, uniformly 
subcritical kernels W' n exhibit uniformly exponential decay; in other words, 

(17) P(|£wJ >&) <exp(-a/c) 

for all sufficiently large n and all k > 1, where a > is constant. [This can 
be shown by considering E(e Wn )-] 
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Finally, from the coupling above, the expected number of vertices of 
G n (l/n) in components of size at least Alogn is at most nP(|j£^ | > Alogn); 
for A = 2/a, say, (17) tells us that this is at most reexp(— logn) = 1/n = 
o(l), so whp there are no such vertices. □ 

4.2. The supercritical case. In the supercritical case we shall show that 
there is a constant A = A(W) such that whp the second largest component 
of G n {l/n) contains at most Alogn vertices. More precisely, we shall prove 
the second part of Theorem 3, that is, the following result. 

Theorem 9. Let (G n ) = (/3ij(n))j je [ n ] be a sequence of weighted graphs 
with \G n \ = n and /3 max = sup 4 „- n (n) < 00 converging to a graphon W. 
Suppose that ||Tw|| > 1 and that W is irreducible. Then there is a constant A 
such that C2(G n (l/n)) < Alogn holds whp, where C%{G) denotes the number 
of vertices in the second largest component of a graph G. 

The basic strategy of the proof is to use an idea from [6] , although there 
will be considerable difficulties in adapting it to the present context. Let us 
first give a rough description of this idea. Note that we have already shown 
in Theorem 1 that G n (l/n) has whp a unique component of order 0(n), 
the "giant" component. All other components are "small," that is, of order 
o p (n). 

Suppose that we have a "supercritical" random graph H on n vertices 
[here H = G n (l/n)], and let k be a large constant to be chosen later. Let us 
select n/k of the vertices of H at random to be "left" vertices, the remaining 
vertices being "right" vertices; we do this before deciding which edges are 
present in the random graph H. If H has probability e of containing a 
small component with at least Aklogn vertices, then [considering a random 
partition of V(H) into k parts] with probability at least e/k the graph H 
contains a small component that in turn contains at least Alogn left vertices. 
Thus, it suffices to show that whp any component of H containing at least 
Alogn left vertices is the unique component of H with size 0(n). 

If k is chosen large enough, then the subgraph induced by the right vertices 
already contains a component of size 0(n). Uncovering the subgraphs Hr 
and Hi of H induced by the right and left vertices, respectively, and all 
edges between the small components of Hr and the left vertices, we have 
already revealed the small components of H: writing H' for the subgraph of 
H formed by the edges revealed so far, any small component of H is a small 
component of H' . All that remains is to uncover the edges of H between the 
unique giant component of Hr and the left vertices; adding these edges to 
H' will cause certain components of H' to merge into the giant component 
but have no other effect. If the edge probabilities in H are bounded below 
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by c/n, c > 0, and A is chosen to be large enough, then it is very unlikely 
that any component of H' with yllogn left vertices fails to merge into the 
giant component, so it is very unlikely that H has a small component with 
at least Alogn left vertices. 

This argument can be applied as it is to H = G n (l/n) if the edge weights 
(3ij(n) in G n are bounded away from zero. However, this is typically not the 
case. Indeed, the main interest is when G n is a graph rather than a weighted 
graph, so many edge weights will be zero. To overcome this difficulty we shall 
use regularity instead of a lower bound on individual edge probabilities. 

Our notation for regularity is standard. Thus, for disjoint sets A, B of 
vertices of a weighted graph G, we write e(A,B) = ec(A,B) for the total 
edge weight from A to B in G. Also, d(A,B) = e(A, B)/{\A\\B\) is the density 
of the pair (A,B). An e-regular pair is a pair (A,B) of sets of vertices of G 
such that 

\d(A',B')-d(A,B)\<e 

whenever A' C A and B' C B with \A'\ > e\A\ and \B'\ > e\B\. A partition 
V of V(G) into sets Pi,...,Pm is e-regular if |Pj| = [n/M\ or \n/M] for 
every i, \Pi\<en, and all but at most eM 2 of the pairs (Pi,Pj), z 7^ j, are 
e-regular. Szemeredi's Lemma [21] tells us that, for any e > 0, there exist 
M = M(e) and no = no(e) such that any graph G with n > uq vertices has 
an e-regular partition V into exactly M classes. (Most formulations give at 
most M classes; applying this weaker form with a slightly smaller e, and 
then randomly partitioning classes into smaller classes of the desired size, 
it is easy to deduce the stated form.) Note that while Szemeredi's original 
lemma applies to unweighted graphs, the extension to graphs with bounded 
edge weights is immediate (either by adapting the proof, or by first replacing 
all edge weights with multiples of e/100, say). 

Given a Szemeredi partition as above, we write G/V for the weighted 
graph with vertex set P\, . . . ,Pm in which the weight of the edge P%Pj is 
d(Pi,Pj) if (Pi,Pj) is e-regular, and zero otherwise. If the edge weights in 
the graph G are bounded by /3 max < 00, then it is easy to check that the cut 
metric distance 8n(G,G/V) is at most 4e(/3 max + 1), say. 

For the rest of the paper we fix G n with G n — > W, /3 max < 00, W irre- 
ducible and ||Tw|| > 1. Note that \\Tw\\ < /3 max , so /3 max > 1. We also fix a 
constant k > 2 with (1 — l/fe) 2 ||Tv[/|| > 1. Let L be a set of n/k vertices of 
G = G n chosen uniformly from among all such sets, and set R = V(G) \ L: 
these are the sets of left and right vertices. Here and throughout we ig- 
nore rounding to integers, which clutters the notation without affecting the 
proofs. 

In what follows we shall consider several different graphs defined in terms 
of G, L and R, some random and some not. We write Gl and Gr for the 
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subgraphs of G = G n induced by L and R, respectively. We use a superscript 
minus to denote graphs in which the edge weights have been multiplied by 
(1 — 1/k) (i.e., reduced slightly), and we replace G by H to denote the 
random "subgraph" of a weighted graph (67, G R , etc.) obtained by selecting 
each edge ij with probability given by its weight divided by n. In particular, 
we shall consider the following graphs at various stages: 

G = G n , 
H = G n (l/n), 
Gr = G n [R] , 

H R = H[R] = G n (l/n)[R], 
H L = H[L]=G n (l/n)[L), 
G R = (1-1/ k)G n [R], 

H R = (1- l/k)G n [R](l/n) = G n [R]((l - l/k)/n) 

as well as 

C\ = largest component of Hr 

and 

C-f = largest component of H R . 

We shall also consider the graph H' defined as in the outline proof above: 
H' will be the subgraph of H consisting of Hr U Hl together with all edges 
of H joining L to R\C\. 

Lemma 10. Let G n , W , k > 2, L and R be as above. Then, with proba- 
bility 1, Gr — > W . Furthermore, for any 5 > the largest component C± of 
satisfies 

\Ci\/n > (1 - 1/Jfe)p((l - 1/kfW) - 5 

whp. 

Proof. The first statement is a more or less immediate consequence 
of Szemeredi's regularity lemma; indeed, using this lemma and a simple 
sampling argument, one easily shows that 5\j(G, Gr) — > in probability (see 
Theorem 2.9 of [14] for an explicit bound on the error term). Although 
convergence in probability is all we need here, the error probability can be 
made small enough to ensure convergence with probability 1. Since G — > W, 
we then have Gr — > W. 
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The second part follows by Theorem 1, noting that G R has n' = (1 — l/k)n 
vertices, so H R = G^(l/n) = G^((l — l/k)/n'), and that Gr —> W trivially 
implies G^ -> (1 - l/fc)W. □ 

We next note that irreducibility of W implies that G cannot have too 
many "low-degree" vertices. The constants in this lemma are not written in 
the simplest form, but rather in the form that we shall use later. 

Lemma 11. Let G n — > W with W irreducible and /3 max = supj j n /3y (re) < 
oo, and let k > 2 be constant. There is a constant a > such that, for all 
large enough n, at most n/(10/3 max ) vertices of G n have weighted degree less 
than 50fc/3 max cm. 

Proof. Set a = l/(10/3 max ). By Lemma 7 there is a a such that W 
has no (a, 6/cc)-cut. Since G n — > W, it follows that, for large enough n, 
the graph G n has no (a, 5fccr)-cut. But then the conclusion of the lemma 
follows: otherwise, let A be a set of exactly n/(10/3 max ) vertices of G n with 
d v < 50k(3 maxC/i for every v € A. Then e(A,A c ) < 50A;/3 max (Tri|^4| — 5ko~n , 
so (A,A C ) is an (a, 5fccr)-cut in G n . □ 

From now on, we fix a constant a > for which Lemma 11 holds. We 
assume, as we may, that 

(18) a<10- 10 /^ ax 
and that 

(19) a <(l-l/k)p((l-l/k) 2 W)/3. 

Our next lemma shows that when we split G = G n into left and right 
vertices, most vertices on the left will send a reasonable weight of edges to 
Ci, the giant component of the random subgraph on the right, that is, the 
largest component of Hr = Gj?(l/n) = G n (l/n)[R\. By Lemma 10 and our 
assumption (19), we have 

\Ci\ > \Ci\>2an 

whp. Also, by Theorem 1, C\ is whp the unique component of Hr with 0(n) 
vertices. 

Lemma 12. Under the assumptions above there is a constant 7 > such 
that whp the number of vertices v G L with ec(v,Ci) < jn is at most 3an. 

Proof. The proof of this technical lemma is somewhat involved. The 
basic idea is to take a Szemeredi partition and use regularity. The problem 
is that a priori we have no control over the distribution of C\ with respect 
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to this partition: it could sit almost entirely within a few parts, and there 
might be many other parts with low density to these parts. The solution is to 
start from C-j~ and use irreducibility to show that the extra sprinkled edges 
expand C-f to a set (Ci) that has a decent number of vertices in almost all 
parts of the partition. 

By Lemma 7, there is a constant b > such that W has no (a, 3fo)-cut. 
We may and shall assume that b < 1/1000, say. Recalling that /3 max > 1, set 



and apply Szemeredi's lemma to find, for all sufficiently large n, an e-regular 
partition V = V n of the weighted graph G = G n into M classes P±, . . . , Pm ■ 
As the partition V depends on G only, not on the random choice of L, whp 
each class Pj satisfies |Pj fl L\ = n/(kM) + o(n). Also, whp every vertex v 
satisfies ec(v,L) = d v /k + o(n). From now on we condition on L, assuming 
these two properties. 

Let G/V be defined as above, noting that 8a(G/V, W) < Ce where C = 
4(/3 max + 1) is constant. As b < 1/1000, we have Ce < b. Hence, if n is large 
enough, which we assume from now on, the graph G/V has no (a, 26)-cut. 

Recall that is the graph on R in which each edge is present indepen- 
dently, and the probability that ij is an edge is (1 — l/k)f3ij(n)/n. Note that 
the subgraph Hr = G(l/n)[R] of H = G n (l/n) induced by the vertices in R 
may be obtained from by "sprinkling," that is, by adding each nonedge 
ij of with probability (a little larger than) f3ij/{kn). Let Cj~ be the 
largest component of H^. As noted above, by Lemma 10 and our choice of 
a, we have \C^\ > 2cm whp. From now on we condition on H^, assuming 
that this holds. 

Let S' be the set of classes Pi with \C^ fl Pj| > an/M. As 

2an < |Cf | < (n/M)\S' \ + (an/M)(M - \S' \) < (n/M)\S' \ + an, 

we have |5g| > aM. Let So be an arbitrary subset of S' of size exactly 
aM (ignoring rounding, as usual). We shall inductively define an increasing 
sequence Sq C S\ C • • • C 5^ of sets of classes of V, stopping the first time we 
reach an St with \St \ > (1 — a)M . In doing so, we shall write Tj for Si \ Si-\. 

Having defined Si, with aM < \Si\ < (1 — cr)M, let Tj + i be the set of 
classes Pj with e(Pj,Si) > bM, where e(-,-) counts the weight of edges in 
the graph G/V. As G/V has no (cx,26)-cut, we have e(Sf,Si) > 2bM 2 . Since 
e{Pj,Si) < P max \Si\ < AnaxAf for every j, it follows that |T i+ i| > 6M//3 max . 
Set Si + i = Si U T; L+ i and continue until we reach Si with \S(\ > (1 — o)M. 
Since there are only M classes in total, and |Tj| > 6M//3 max for every i, the 
process just defined stops after I < /3 max /6 steps. 
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Claim 13. Whp every class Pj in Si contains at least en/M vertices of 
C\, the giant component of Hr. 

To show this, we shall use induction to prove the stronger statement 
that, whp, every class Pj G Tj contains at least Qn/M vertices of a certain 
set C[ C C\ that we shall define in a moment, where 



and it is convenient to set To = So . 

Set C'q = Ci , so the base case i = holds by the definition of So = T$. 
In proving the induction step, we shall use the "sprinkled" edges between 
Tj + i n R and Si PI R; we define C' i+l to be the set obtained from C[ by adding 
all vertices of Tj +1 joined directly to C[ by such sprinkled edges. 

For the induction step, let P G T + i be a class of the partition V. By 
definition of T + i, we have e(P, Si) > bM, where e(-, •) counts the weight of 
edges in G/V. Since e(Pj,Pk) < /3 m ax for every j, k, it follows that there 
is a set Q of least 6M/(2/3 max ) classes Pj G with e(P,Pj) > 6/2 for every 
Pj G Q- By definition of G/P, each pair (P,Pj), Pj G Q, is e-regular (in 
G n ) with density at least 6/2. Set P' = Up^gQ ^j- From standard properties 
of e-regularity, it follows that the pair (P, P') is (2e)-lower regular with 
density at least 6/4. By the induction hypothesis, each Pj G Si contains at 
least an/M = a\Pj\ vertices of C[. Hence |P' n C[\ > Ci\P'\ > 2e\P'\. By 
regularity, it follows that 



(20) \{v eP:e G (v,C'i HP') < 6|C- n P'|/5}| <2e\P\ = 2en/M. 

In particular, of the (1 + o(l))(l — l/k)n/M vertices of P D R, at least 
N = nj (3M), say, have 



where for the last inequality we used \Q\ > 6M/(2/3 max ). 

A standard calculation using concentration of the Binomial distribution 
implies that, whp, at least Ncib 2 / (llkf3 mSLK ) = a+\\P\ of these vertices v are 
joined to C- by a sprinkled edge, so \C' i+1 fl P\ > Cj + i|P|, completing the 
induction argument and hence (as C 4 ' C Ci for all i) proving the claim. 

The proof of Lemma 12 is also essentially complete. Set 7 = q6 2 / (10/3 max ) . 
Since |5o|, |5"^| < aM, it suffices to prove that, whp, for each P G Tj, 1 < 
j < £, there are at most a\P\ vertices u of P fl L with eo(v, C±) < 771. Since 
C[ C C\ for every i, while q > c? and 2e < cr, this is immediate from (20) 
and (21). □ 




(21) 



e G (v, Ci) > e G (v, Ci n P') > b\C[ D P'\/5 > q6|P'|/5 
> Ci 6 2 n/(10/3 max ), 
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In proving Theorem 9, we shall first uncover all edges of H = G n (l/n) 
between vertices in R. In addition to revealing the giant component C\ of 
Hr = G n (l/n)[R], this also reveals the small components of Hr. It will turn 
out that certain small components cause difficulty in the proof. Let us say 
that a vertex v of G = G n has low degree if e(v, L) < 49/3 max cm, where a is 
the constant in Lemma 11. We write R~~ C R for the set of low-degree vertices 
in R. Note that from Lemma 11 and the randomness of our partition, we 
have 

|iT|<n/(10/3 max ) 

whp. Let us say that a component C of Hr is annoying if |Cnii - | > |C|/2. 

Lemma 14. Let A s be the number of vertices of Hr in annoying s-vertex 
components. Then whp we have A s < nexp(— s/200) for every s > 1. 

Proof. Throughout we confine our attention to the subgraph of G in- 
duced by the vertices in R. We shall condition on R, assuming, as we may, 
that \R~\ < ro/(10/3 max ). For any s we have A s < 2|i?~| < n/(5/3 max ) < n/5, 
so we may assume that s > 200, say. 

Let v be a random vertex of R, and let C v denote the component of Hr 
containing v. We shall first show that, for some constant a > 0, we have 

Ps = F(\C v \=s,\C v nR~\ > |C w |/2)<exp(-as). 

Note that 

(22) p s <^(\C v \>s,\C v nR + \<s/2), 

where R + = R \ R~ . Let us explore the component C v in the usual way, 
writing v\ , Vi , . . . , Vt for the vertices of C v in the order we reach them, so 
v\ = v. For technical reasons, we continue the sequence (vi) by starting a new 
exploration at a new random vertex whenever we exhaust the component 
currently being explored. 

If \C V \ > s and \C V n < s/2, then at least s/2 — 1 of the vertices 
V2, ■ ■ ■ ,v s are in R~ . In particular, at least s/2 — 1 of the children of v\, . . . ,v s 
are in R~ . Let It denote the number of children of vt that are in R~ . Then 

p.<¥^J2£t>8/2-lj. 

Now, as \R~\ < ra/(10/3 max ), for any vertex w we have eG{w,R~) < n/10. In 
particular, when we test edges from a vertex vt to vertices not yet reached 
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in the exploration, the chance of finding more than k edges to R in the 
random subgraph Hr = Gr(1/ti) of Gr is at most 10~ fc . Hence, 

£ (TV 1 ) 10 "' 

r>s/2-l x 7 

with the first factor coming from the number of sequences k±,...,k s with 
ki > and ^2 ki = r. A simple calculation shows that p s < exp(— s/100) for 
all s > 200, say. 

Let a = 1/100, and fix an s > 200. Then E(A S ) = \R\p s < nexp(— as), 
and it is easy to show that V&r(A s /n) = o(l). [This follows easily from the 
observation that the probability that two fixed vertices v and w are in the 
same component of size s is o(l), while for disjoint sets X and Y of s 
vertices, the events that X and Y are the vertex sets of the components 
containing v and w are almost independent.] Setting c = a/2 = 1/200, it 
follows that A s < nexp(-cs) holds whp for each fixed s. Hence, there is some 
so(n) tending to infinity such that whp the bound A s < nexp(-cs) holds 
simultaneously for all s < sq. For s > sq we simply use Markov's inequality 
to note that 

F(3s>s :A s >nexp(-cs)) < ^ P(^ s > nexp(-cs)) 

s>s 

^ nexp(— as) 
~ ^ nexp(— cs) 

s>s 

= exp(— cs) — > 0. 

s>s n 



We are finally ready to prove Theorem 9. 



Proof of Theorem 9. Let G n —> W be a sequence of weighted graphs 
satisfying the assumptions of the theorem, and define k, L, R and a as above. 
As before, we write H = G n (l/n) for the random graph whose component 
distribution we are studying. 

As noted earlier, if H has probability e of containing a small component 
with at least Ak log n vertices, then with probability at least e/k, H has 
such a component containing at least Alogn vertices from L. By Theorem 
1, H has whp a unique component with 0(n) vertices, so it suffices to prove 
that, for some constants A and 5 > 0, whp every component of H containing 
at least Alogn vertices of L has size at least 5n; we shall prove this with 
5 = 2a and A a (large) constant to be chosen later. 

As above, define C\ to be the largest component of Hr = H[R\. Let H' 
be the subgraph of H consisting of all edges within R, all edges within L 



PERCOLATION ON DENSE GRAPH SEQUENCES 



31 



and all edges between L and vertices of R \ C\ . As H is formed from H' 
by adding some edges between C\ and other components, whp every small 
component of H is a component of H'. In particular, it suffices to prove that 
whp every component C of H' with at least A\ogn vertices in L is joined 
to Ci in H. 

Let 7 be as in Lemma 12 above. For the rest of the proof we condition on 
Hr, assuming as we may that the property described in Lemma 14 holds. 
Call a vertex v € L bad if ec(v, C\) < jn, and let B C L be the set of bad 
vertices. By Lemma 12, we may assume that \B\ < 3an. 

It suffices to prove the following claim, which should be taken to hold 
conditional on Hr, under the assumptions above. 

Claim 15. Let v be a random vertex of L, and let C v be the component 
of H' containing v. There is a constant c>0 such that the (conditional) 
probability that \C V fl L\ = s and \C V fl B\ > \C V fl L\/2 is bounded above by 
exp(— cs) whenever < s < an. 

Indeed, to deduce Theorem 9 from Claim 15, note that if \C V fl L\ = s > 
Alogn and \C V C\B\ < \C v C\L\/2, then when we uncover the (so far untested) 
edges between L and C\, the probability that there is no edge from C v to 
C\ in H is bounded by 

exp(-7|C t , fl (L\ B)\) < exp(-s7/2) < exp(-A7logn/2), 

which we can make o(l/n 2 ) by choice of A. On the other hand, from Claim 
15, for Alogn < s < an we have 

F(\C V fl L\ = s, \C V CiB\> \C V n L|/2) < exp(-cs) < exp(-Aclogn), 

which we can again make o(l/n 2 ) by choice of A. Hence, summing over 
Alogn < s < an, the probability that v is in a small (size at most an, say) 
component of H' containing at least Alogn vertices of L but not joined to 
C\ in H is o(l/n), and the probability that such a vertex exists is o(l). As 
noted above, this suffices to prove the theorem. 

It remains to prove Claim 15. Recall that we have already conditioned 
on Hr; in particular, we have revealed all edges of H' between vertices in 
R. It remains to reveal the edges of H' between vertices in L, and between 
vertices in L and vertices in R\C\. (By definition of H' , there are no edges 
of H' between L and C\.) Let v = v\ be a random vertex of L. We shall 
explore the component C v of v in H' in the following way: having "reached" 
vertices v%, . . . ,v r G C v fl L and "tested" v\,. . . , Vt-i, t < r, we next "test" 
vertex vt- First, we add any neighbors (in the graph H') of Vt in L not among 
the vertices reached so far to our list of reached vertices. Then we test edges 
between vt and R\C\, finding the set of components of Hr that vt is joined 
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to in H' . For each we find its unreached neighbors in L and add them to 
our list. 

The basic idea of the proof is simple, and similar to that of Lemma 14: 
roughly speaking, at each step we are much more likely to reach a good vertex 
of L than a bad vertex. It will follow that the chance that \C V H L\ = s but 
that the component C v contains at least s/2 bad vertices is exponentially 
small in s. The problem is that there is an exceptional case: when we reach 
an annoying component on the right, this may send many more edges in H' 
to bad vertices than to good ones. But an annoying component of size s' is 
very unlikely to have more than s' bad neighbors, and the chance of reaching 
such a component will be exponentially small in s', so the contribution from 
such annoying components is negligible. Turning this into a formal proof is 
now a matter of accounting. 

Fix s < an, and recall that we must show that 

F(\C V n L\ = a, \C V n B\ > s/2) < exp(-ca). 

Let us define a quantity ft associated with the testing of vertex vt- set 

/t = exp(6- g/2 -1/8)1^, 

where b and g are the number of new good and bad vertices of L that we 
reach when testing vt, and is the indicator function of the event E% that 
after testing vt we have reached at most s vertices in C V H L. If t > \C V \, so 
there is no vertex i>t to test, set ft = 0. The role of the indicator function lg t 
is simply to stop our exploration process if we reach more than a vertices in 
C v n L, at which point there is nothing to prove. 

Set F s = U s t=1 f t . If \C V HL\ = s and \C V DB\> s/2, then 

F s = exp(\(C v \ {v}) HB\-\(C V \ {v}) n (L \ B)\/2 - a/8) 

> exp(s/2 - 1 - a/4 - s/8) = exp(s/8 - 1) > exp(s/9), 

assuming, as we may, that a > 100. Hence Claim 15 follows if we show that 
E(i^ s ) < 1. In turn, it suffices to show that, conditional on the exploration 
so far, we have E(/t) < 1 for each t. 

Let the small components of Hr be Ci, . . . , C m . We shall test v t in several 
steps. In step we check for edges from vt to unreached vertices in L. In 
step i, 2 < i < m, if d has not previously been reached by our exploration, 
we check for edges from v± to Q, and, if we find such an edge, then test for 
edges from C, to unreached vertices in L. At every step we assume, as we 
may, that we have reached at most s vertices in L; we shall suppress the 
corresponding indicator functions in the estimates below. 

We may write ft as a product of a factor f[ (that also depends on t) for 
each step i; up to indicator functions corresponding to l^ t , we may write 

/^ = exp(6 -<7o/2-l/8) 
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and 

f'i = exp(6j - gi/2) 

for i = 2, . . . ,m, where 6j and gi are the number of new bad/good vertices 
reached in step i. (There is no step 1 as we do not test for edges to C\.) 

Since \B\ < 3an and, from (18), a < l/(600/3 max ), for any v € V(G) we 
have ec{v,B) < n/200. Hence, writing B' for the set of vertices in B not so 
far reached, conditional on everything so far, we have 

E(exp(6 )) = ]J(l + (e- 1)/W") 
weB' 

< ] [ exp((e - l)Pv tW /n) 

(23) 

= exp((e - l)e G (v t ,B)/n) 
<exp(l/100). 

Using only g > it follows that E(/jJ) < E(exp(6 )) exp(-l/8) < exp(-l/10). 

Now let us condition not only on the results of testing v±, . . . ,vt-i, but 
also on steps 0, 2, — 1 of the testing of vt, assuming as we may that we 
have reached at most s < an vertices of C V DL. Let F{ be the event that we 
find an edge from vt to Cj. 

Suppose that Cj is not annoying. For any vertex v € R \ R~ , the total 
weight of edges from v to unreached vertices in L is at least 

(24) e G (v, L) - /3 max s > 49/3 maxl rn - /3 ma x<™ > 48/3 max cjn. 

On the other hand, for any vertex v, 

max 

< 3/3 max o-n. 

As \Ci nR~ \ < \Ci\/2, the total weight w of edges of G from Cj to unreached 
vertices in L \ B is at least 21 (3 max an\Ci\. A similar calculation to (23) but 
now using 1 — p(l — e -1 / 2 ) < exp(— (1 — e _1 / 2 )p) < exp(— p/3) shows that 
E(exp(— gi/2) \ F-i) < exp(— 7/3 max o"|Cj|). [Of course, we should update the 
set of reached vertices as we go, but (24) is valid at every step.] Arguing as 
for (23), since the total weight of edges of G between C, and B is at most 
3/3 max an|Ci|, we have E(exp(6j) | F h gi) < exp(6/3 max a|Ci|), so E(// | F) < 1- 
Since f- = 1 whenever Fi does not hold, it follows that the (conditional) 
expectation of f- is at most 1. 

Finally, suppose that Cj is an annoying component of size s'. Using the 
bound ea(v,B) < f3 max \B\ < 3/3 max <rn to bound E(exp(6j)) as above, and 
using gi > 0, we have E(/| | Fi) < exp(6/3 max fjs'). Since P(i^) < /3 max |Ci|/n = 
Pmaxs' /n, it follows that 

WD < 1 + ^^(exp(6/3 maxf 7 S / ) - 1). 
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Since a < 10 10 //3 max > for s' < 10 6 /3 max , say, the bracket above is at most 
l/(100/3 max ), so 

log(E(//)) < . 
For s' > 10 6 /3 max we use the very crude bound 

log(E(//)) < E(fi) - 1 < exp(6/3 max ^')- 

n 

Recall that A s > counts the number of vertices in annoying s'- vertex compo- 
nents, and that each such component contains s' vertices. As all expectations 
are conditional on everything preceding them, we can multiply the expecta- 
tions above together to conclude that 

Iog(E(/ t )) < ~ + £ 



10 V WOn 

s'<10 6 /3 m ax 

+ > exp(6/J max c7s ). 

n 

s'>10 6 /W 

By assumption, A s > < exp(— s'/200)n for every s' , while ^j-Aj — n - Hence 
log(E(/ t )) < -— + — + ^ax exp(6/3 max as / - s'/200) 

s'>10 6 /3 m ax 

< -0.09 + Yl exp(-s'/400) < 0. 

s'>10 6 /3 m ax 

In other words, E(/t) < 1. Recalling that the argument above, and hence 
the final estimate, hold conditional on all previous steps in the exploration, 
it follows that 

As noted earlier, this implies Claim 15 and hence Theorem 9. □ 

The proof of Theorem 9 presented above is rather involved, and the reader 
may well wonder whether it can be simplified. While writing this paper, we 
found various proofs of Theorem 9 that were indeed much simpler; unfortu- 
nately they were also incorrect. Of course, there may well be a simple proof 
that we have missed. If so, it would be interesting to find one. 
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